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Abstract — Functional dependence graphs (FDGs) are an im- 
portant class of directed graphs that capture the functional 
dependence relationship among a set of random variables. FDGs 
are frequently used in characterizing and calculating network 
coding capacity bounds. However, the order of an FDG is usually 
much larger than the original network and the complexity 
of computing bounds grows exponentially with the order of 
an FDG. In this paper, we introduce graph pre-processing 
techniques which deliver reduced FDGs. These reduced FDGs 
are obtained from the original FDG by removing nodes that 
are not "essential". We show that the reduced FDGs give the 
same capacity region/bounds obtained using original FDGs, but 
require much less computation. The application of reduced FDGs 
for algebraic formulation of scalar linear network coding is also 
discussed. 

Index Terms — network coding capacity, linear programming 
bounds, functional dependence graphs, complexity reduction, 
algebraic formulation. 

I. Introduction 

Characterizing the capacity region of network coding is an 
important fundamental problem. For single session networks, 
it is well known that the capacity region is given by the max- 
flow bound \\\. However, the max-flow bound is no longer 
tight when multi-session networks are considered. It is shown 
in (6) that the exact capacity region for general networks can 
be written as a function of entropy region, T*, and some 
constraints induced by the network topology. Unfortunately, 
characterization of T* is still open for n > 4 and infinite 
number of information inequalities need to be considered (7). 
An explicit outer bound in terms of the set of polymatroidal 
functions, T n is referred as Linear Programming (LP) bound 

GO 

Functional dependence graphs (FDG) are a class of directed 
graphs that capture the functional dependence relationship 
among a set of random variables. FDGs are first used in (2) 
to establish conditional independence among random variables 
involved in a communication system which is useful to char- 
acterize bounds on the capacity of the communication system. 
Variants of FDGs are used in [ 3 ] and |5 ] to characterize 
computable outer bounds on multi-session network coding 
capacity. FDG plays an important role in studying the capacity 
for networks with edge capacity constraints. Besides the 
progressive ^-separating edge-set bound (3j and the functional 
dependence bound [5 ] which are obtained using FDGs, many 
other capacity bounds e.g. (4), and the LP bound are closely 



related with FDGs of the network. The random variables and 
the functional dependence constraints which are important 
in computing the LP bound of a given network are directly 
reflected by FDGs. 

In this paper, we introduce graph pre-processing techniques 
to reduce the size of an FDG by removing edges and nodes that 
are not essential. The resulting FDG is refereed as Reduced 
Functional Dependence Graph (FDG) which is of smaller size 
but preserves important properties of the original network. We 
show that the capacity of the original network can totally be 
determined by the reduced FDGs. As a result, reduced FDGs 
can be used to compute bounds on the network coding capacity 
of a given network with lesser computational resources. More- 
over, removing nodes in the original FDGs is equivalent to 
removing the random variables involved in computing the LP 
bound of a given network. Hence the complexity of computing 
LP bound reduces exponentially using the notion of reduced 
FDGs. 

Due to the similarities between an FDG and the directed 
line graph (9) of a given network, an FDG can also be 
used to construct the algebraic formulation of scalar linear 
network coding. With the proposed reduced FDG, the number 
of variables required for such formulation will be reduced and 
the complexity for computing the transition matrix can also 
be significantly reduced. 

The rest of the paper is organized as follows. In section II, 
network model and formal definition of an FDG are given. 
Graph pre-processing techniques leading to reduced FDGs 
are given in section III. Applications of the main results are 
discussed in section IV, including complexity reduction for 
computing LP bound and complexity reduction for algebraic 
formulation of scalar linear network coding. Finally, the paper 
is concluded in section V. 

II. Background 

A. Network Model 

A network is represented by a directed acyclic graph 
Q = (V,£), where V is the set of nodes and 6 is the set 
of edges. For an edge E = (V, U) G E, define Tail(E') = V 
and Head(£?) = U. For V G V, the set of edges entering 
into V and leaving V are denoted by In(V) and Out(V) 
respectively. To simplify the description, we also define the 
set of edges entering into and leaving an edge E as ln(E) = 
{£" : Head(£') = Tail(^)} and Out(E) = {E f : Tail(£') = 



Head(£?)} respectively. It is easy to see that if E = (V, U), 
In(E) = In(V) and Out(E) = Out(U). 

Let S = {1, • • • , |«S|} denote the set of independent infor- 
mation sources available at some nodes (called source nodes) 
in a network via mapping a : S i— » V. The sources are 
demanded by some nodes in a network called sink nodes. A set 
of sources demanded by a given sink node is described by the 
mapping b : V (-» V(S) where, V(A) is the power set of the 
set A. Let T denote the set of sink nodes in the network and 
thus the set of sources demanded by the sink T G T is b(T). 
If each source is demanded by exactly one sink, the network 
is called multiple-unicast network. Without loss of generality, 
we assume that In(5) = Out(T) = 0,VS G S,T G T. For 
E e £, C E denotes the maximal rate that can be conveyed 
through the link E. 

Definition 1: For a given network Q = (V, £), an informa- 
tion rate tuple R = : 5 G 5) is achievable if there exists 
a network code C = {f E : £ G £,# T : T G T} of block 
length n, defined by 

• For all S G <S,£? G Out (5), local encoding functions 

. For all V G V \ (5 U T),£ G Out(V) local encoding 
functions : n^'einOO 

• and for all T G 7 , decoding functions #t • 

such that 

lim n -1 log |34 n) | = lim n^HfY^) > R s 
lim n^HiU^) < lim n _1 log|wL n) | < C E 

n— )-oo n— )>oo 

and for all T G T 

lim Pr( 5T (/7' n) : £ G In(T)) ^ (M n) : 5 G 6(T))) = 0. 

n— )-CO 

The closure of the set of all achievable rate tuples for a 
given network is called the capacity of the network. 

B. Functional Dependence Graph 

Given a network Q — (V, £ ) with sets of random variables 
{Ys : S G S} and {U E : E E £} representing the information 
generated by sources and carried by edges respectively, a 
valid network code for the network satisfies the following 
constraints. 
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Note that constraint ^ specifies the independence of the 
sources, ^ and ^ give encoding constraints, ([5]) corresponds 
to the decoding requirements, ([6J means that the entropy of 
variables carried by any link cannot exceed the link capacity. 



Using the encoding and decoding constraints ([3])-([5} for a 
given network Q (i.e., local functional dependence constraints 
at nodes), an FDG can be constructed for the set of source 
and edge random variables. FDG is defined in [5 ] as: 

Definition 2 {Functional dependence graph Ipy): Let V be 
a set of random variables. A directed graph Q = (V,<?) is 
called a functional dependence graph for V if and only if for 
all V G V 

H(V\{U:(U,V)eE})=0. (7) 

Q is a directed cyclic graph and of the order |V| = \£ \ + |<S|. 
£ is determined by the dominance relationship defined in ([3])- 
(5). For V G V, let Down(F) = {V : (V, V) G £} be the set 
of immediate downstream nodes (also known as children) of 
V and Up(V) = {V : (V, V) G £} be the set of immediate 
upstream nodes (also known as parents) of V. Thus, for S G S, 
Down(F 5 ) = Uout(S) and if b(T) = 5, Up(Y s ) = U ln(T) . 
Moreover, for E G £, Dowu(Ue) = Uo ut ( E ) and Up (C/e) = 
Uin(E) • Provided that the communication network Q is directed 
acyclic, every node Ys, S G S in Q must be a member of a 
cycle due to the decoding constraints ([5J. 

III. Main Results 
A. Reduced Functional Dependence Graph 

For a given network 5 = (V, £), if it is a connected graph, 
|£| ^ |V| — 1. Therefore the order of an FDG of the given 
network, |V| = \£\ + is usually much larger than the 
original network. As the complexity of computing network 
capacity bounds usually grows exponentially with the size of 
an FDG, it is desirable to reduce the size of it. 

Source random variables Ys, S G S may be regarded 
as primary random variables introducing randomness into the 
network. Edge random variables Ue,E g £ may be regarded 
as secondary random variables since all edge random variables 
are function of the source random variables. In this section we 
focus on eliminating certain edge random variables (and hence 
corresponding nodes in FDGs) such that the capacity of the 
network remains unchanged. 

Definition 3: Given a network Q — (V,£), let E = 
(V, U) G £, the reduced network Q\E = (V ', £') is defined 
by: V = V and £ ' = £\E. Moreover, the information carried 
by edges in ln(E) are made available to both node V and U, 
i.e. for K G Tn(E), Head(K) = {U, V}. 

The procedure for obtaining Q\E from Q is shown in Fig[T] 
Note that the reduced network Q \ E is no longer a traditional 
network since some edges have more than one destination 
nodes. However, the definition of capacity still applies. 

Although the transformation from Q to Q \ E makes the 
network look more complex, it reflects a reduction in their 
corresponding FDG sub-networks (referred as Q and Q\Ue 
respectively) as shown in Figj2j 

Definition 4: Given an FDG Q = (V, £), a reduced FDG 
G\U E = (V, £') is given by V = V \ U E and 

£ ' = {£ \ {E G £ : Head(^) = U E or Tail(E) = U E }} 
U {(A, B) : A e Vp(U E ), B G Down^)}. 




(a) Network G (b) Network G \ E 



Fig. 1. Construct new network by removing edge E 




Down(C/ £ ) " Down(£/ £ ) 

(a) FDG G (b) FDG G\U E 



Fig. 2. Construct new FDG by removing Ue 

Theorem 1: Network Q and Q\E have the same capacity 
if the following conditions are satisfied: 

1) Tail(^) S 

2) C E > YsE'-.E'eME) C E' 

Proof: Assume that an information rate tuple R = (Rs : 
S G <S) is achievable in the original network 5- According to 
Definition [T] there exists a network code C, composed of a 
set of encoding functions {/e - E e S} and a set of decoding 
functions {gr ' T G T}, with which the communication 
requirement is satisfied. Given C, we can easily construct a 
network code C' for the new network Q \ E via the following 
modification: 

• excluding the encoding function for the removed edge E 
. if Head(£) T, for edge K G Out(£), let AT = In(K)\ 
E be the set of incoming edges to the edge K except 
E. Replacing fx with a new composite function f' K = 

. if Head(£) = T G T, let Af = In(T) \ J5. Changing # T 

to ^ = ^(^(n^ W )4^),nx^4 n) ) 

With above modification, if Head(£) ^ T, C = (C \ 
{/is, {/*, K G In(E)}}) U {/jj and if Head(£) = T G T, 
C' = (C\{/#,#t})U{<7^}. Note that the encoding function 
f K defines a mapping: ]J X e x • Yl E >ein(E) 
which produces the same output as fx in the original network 
code C. Similar arguments hold for the decoding function g' T 
if Head(£) = T G T. Therefore, the rate tuple R = (ife : 
5 G 5) is achievable in (/ \ E with the network code C' . 



On the other hand, assume that a rate tuple R = (Rs : 5 G 
5) is achievable in (/ \ .E. There exists a network code C 7 , 
with which the communication requirement can be satisfied 
in Q\E. Based on C' , we can construct the network code C 
for the original network if the conditions stated in this theorem 
are satisfied. 

Condition 1) states Tail(E') S and Condition 2) states 
that capacity of edge E is larger than or equal to sum capacity 
of all edges coming into node V. Combined both conditions, 
we know that edge E is capable of forwarding all received 
data without any compression. Therefore, we can set the 
encoding function at edge £ as a simple replication, i.e. 

4 n) = MrWin^z^) = iw^wi- Thus > the 

network code C = C' U Note that this construction does 
not violate any encoding constraint for downstream edges of E 
since the same information is available to them for encoding as 
in the reduced FDG. Therefore, all the variables in the original 
network are the same as that in the reduced network and thus 
decoding requirements are satisfied. ■ 

Due to the correspondence of network and its FDG, The- 
orem [T] can be applied to reduce the size of FDG. Before 
introducing the corollary of Theorem [T] in FDG, the definition 
of redundant random variable is made. 

Definition 5: A node U E (corresponding to the edge ran- 
dom variable Ue) in a functional dependence graph Q = 
(V, E) is called redundant if the capacity region of the reduced 
FDG Q\Ue is the same as the original FDG Q. 

Corollary 1: A node Ue G (V \ Ys) is redundant if the 
following conditions are satisfied: 

1) Y s n Up(C^) = 

2) C E > T,A:U A ev P (u E ) °a 

Corollary [T] follows from Theorem [T] and Definition [5] A 
simple extension of Corollary [T] from single edge variable to 
a group of edge variables gives the following corollary. 

Corollary 2: A set of nodes Ux C (V \ Y$) is redundant 
if the following conditions are satisfied: 

1) \Jp(U El ) = Vp(U E2 ), ve u e 2 g X 

2) Down([/ Bl ) = Down([/£ 2 ), \/E 1 ,E 2 € X 

3) Y S n Vp(U x ) = and E E& x Ce > Eu Ae v P (u x ) Ca 
The proof of Corollary [2] follows by treating the set of variables 
Ux as a single "super- variable". Note that the redundant 
variables identified by Corollary [T] and Corollary [2] correspond 
to the edges in original network where network coding is not 
necessary since they have sufficient capacity to forward all 
received information to their children. 

In practise, most networks use packet-based transmissions. 
Therefore, we can assume unit-edge capacity. The edges that 
have large capacity (convey multiple packets) can be repre- 
sented by multiple parallel edges of unit capacity. Based on 
this assumption, Corollary 1 and Corollary 2 can be simplified 
as follows. 

Corollary 3: For Ue G (V \ Ys), it is redundant if 
|Up(£^)| = 1 and Up(^) H Y s = 

Corollary 4: Let Ux C (V\Xs), they are redundant if the 
following conditions are satisfied. 



1) Up(U El ) = Up(U E2 ), ve u e 2 e X 

2) Down(f/ Bl ) = Down([/ B2 ), \/E u E 2 € X 

3) XJp(U x ) C\Y S = and |Up(tf*)| < \U X \ 

Example 1: An example of reduced FDG is shown in Fig(3] 
The network in Fig[3Ja) is constructed in [10] and each edge 
in the network has unit capacity. Fig[3jc) gives the FDG 
constructed based on Definition [2] This FDG can be reduced 
to the one shown in Fig[5Jb) based on Corollary [3] 



*1 Y 2 




(c) 



Fig. 3. (a) A network in [lOj (b)The Reduced FDG (proposed) (c)The 
conventional FDG 

B. Further Reduction of FDG 

It has been proved in fTT) that linear network coding is 
not sufficient to achieve the capacity for general networks. 
However, in most practical scenarios, linear network coding 
is preferred due to its simplicity. With linear network coding, 
the data carried by an edge E is a linear combination of input 
data available at Head(£?). Without loss of generality, we may 
assume that each edge has unit capacity and the encoding 
function at edge E is described by (9): 

Je -Ue = ^2 £ e,e'Ue' 

E':E'eln(E) 

where the coefficients Se,e' are elements chosen from de- 
signed alphabet and H(U E ) <l,VEe £. 



Theorem 2: Assume each edge in the network has unit 
capacity, then the network Q and Q\E (refer to Fig [I]) have the 
same linear network coding capacity if one of the following 
conditions is satisfied: 

1) Tail(^) (£S and |In(^)| = 1 

2) Head(£) T and \Out(E)\ = 1 

Proof: As shown in the proof of Theorem [T] if a rate 
tuple R = (Rs : S G S) is achievable in the Q, it is always 
achievable in Q \ E with the same linear network code after 
simple modification. 

On the other hand, if a rate tuple R is achieved in Q\E with 
a linear network code C', we will show that a linear network 
code C can be constructed for the original network Q with 
given conditions. 

For K G Out (£7), let X = In(K) \ E. Since C' is a linear 
network code, the encoding function for edge K is of the form: 

fx 'U K = Y,E:Eeln(K) £ E,KU E = Y.E' eln(E) £ E' ,kU E > + 

^2xex £ x,kUx- If condition 1) is satisfied, C is constructed 
from C' by letting edge E perform simple forwarding, i.e./# : 
Ue — Uj n (E)- If condition 2) is satisfied, the encoding for edge 
E can be set to f E :U E = 52 E 'ein(E) £ e>,kU e >- For the edge 
K = Out (E), the encoding function is changed to fx :Uk = 
Ue + ^Zxex £ x,kUx- For either case, the constructed code 
C achieves the same rate tuple R = (Rs : S G S) since all 
edges carry the same information as that in Q \ E. ■ 

Applying Theorem [2] in FDG results further reduction 
described by the following corollary. 

Corollary 5: When linear network coding capacity is in- 
terested, a node Ue G (V \ Ys) is redundant if one of the 
following conditions is satisfied: 

1) |Up(£^)| = 1 and Y s n Up(C^) = 

2) |Down(L^)| = 1 and Y s n T>own(U E ) = 

Note that the extension to group nodes is straightforward 
and thus omitted here due to space limitation. 

Example 2: Consider the network shown in Fig|4ja) which 
appeared in |T2| and has unit edge capacity. The corresponding 
FDG of this network is shown in Fig|4jb). By Corollary [T] it 
is reduced to |4jc) and it can be further reduced to the one 
in FigQd) according to Corollary [5] if only linear network 
coding capacity is interested. With the reduced FDG, it is easy 
to show that H(Y\,Y2\U%) = 0. Thus the sum capacity that 
can be achieved with linear network coding is bounded by: 
Ri + R2 < H(U$) < 1, which is the same as the capacity 
achieved by routing. 

IV. Applications of Reduced FDG 

Although the concept of reduced FDG is very simple and 
the procedures to obtain reduced FDG are straightforward, the 
range of its application is wide and two of them are discussed 
in this section. 

A. Complexity Reduction for Computing LP Bound 

The Linear Programming (LP) bound is the set of all rate 
tuples satisfying (p}-([6]) together with the basic inequalities. 




(c) (d) 



Fig. 4. (a) A network taken from [12] (b)The conventional FDG (c)The 
reduced FDG (proposed) (d)The further reduced FDG 

The basic inequalities are non-negativity of entropy, condi- 
tional entropy and conditional mutual information. The set of 
the elemental basic inequalities 

H(A\{Y S , U s } \ A) > 0, A e {Y s , U £ } (8) 
I(A; B\C)>0,A^B^®,A,Be {Y s , U s }, 

CC{Y s ,Us}\{A,B} (9) 

represents (implies) all Shannon-type inequalities for the ran- 
dom variables Y$,Us and is minimal (8). Note that the 
constraints ([l}-([9]) are linear and hence the LP bound can be 
computed by solving the linear program 

max wsH(Y s ) subject to @ - ([6J, (|S) - (10) 

ses 

where ws is any non-negative constant for source S called 
weight coefficient. 

For every set of chosen weight, the dimension of this 
optimization problem is 2^ — 1, where N = \£\ + |<S| and 
the number of constraints is N + { N 2 )2 N - 2 + 1 + 2\£\ + \T\, 
including N + {^)2 N ~ 2 elemental information inequalities, 
1 equality representing source independence, \£\ equalities 
for encoding requirements, another \£\ inequalities capturing 
the edge capacity constraints and \T\ equalities stating the 
decoding requirements. 

Note that the dimension of this optimization problem grows 
exponentially with the order of its FDG, N. Therefore, any 
reduction of FDG size results in exponential reduction of the 
problem dimension. The reduction in the number of constraints 



is more significant since it grows even faster with the size of 
FDG. 

Example 3: (Butterfly Network): Consider the Butterfly net- 
work shown in Figj5ja). If the original FDG shown in Fig|5jb) 
is used. The number of variables involved is N = 9 and the 
dimension of the linear program is 511. A matrix representing 
the constraint set is of size 4634 x 511. With the reduced 
FDG shown in Fig|5jc), N reduces to 5, thus the problem 
dimension and the constraint size reduces to 31 and 91 x 31 
respectively. The overall complexity reduction exceeds 99%. 
Combining the reduced FDG with other complexity reduction 
techniques shown in (13), the dimension of the LP bound for 
Butterfly network can be further reduced to 8. 




(c) 



Fig. 5. (a)Butterfly Network (b)The reduced FDG (c)The conventional FDG 

B. Complexity Reduction in Algebraic Formulation of the 
Scalar Network Coding 

The directed line graph is introduced in (9J to construct 
an algebraic formulation of the scalar network coding. For a 
network Q = (V, £), let its line graph be denoted by Q — 
(V, £) and its FDG be denoted by Q = (V, £). Each V G V 
corresponds to an edge E G £ and (Vi, V2) G £ if and only if 
Head(Vi) = Tail(l^) in Q. Compare it with the definition of 
FDG, it is not difficult to conclude that Q is a subgraph of Q. 
Therefore, Q can be used to construct the algebraic formulation 
as well. 

Following the similar notation used in [9 ], let A be the 
matrix that maps the source to its out-edges, F be adjacent 
matrix of Q and B be the matrix that maps in-edges of the 
receiver to decoded symbols. The transition matrix of the 
network is formulated as M = A(I - F^B |9|. The total 
number of variables involved in this formulation is \£\ and 
adjacent matrix F is of the size |V| x |V|. Let Ay and A# 
denote the amount of nodes and edges removed from the 



M = A(I-F)~ 1 B = 



a 4 76 + ^3/^275 
«576 + «675 



«2/^l73 + «2ft>74 

(a 3 /3i + ^4/^3)73 + ^3^274 

^5/^373 + «674 



«l7l + ^2^172 

(a 3 /3i +a 4 /3 3 )72 
^5/^372 



(11) 



original FDG respectively. Therefore, with the reduced FDG, 
the variables involved in the formulation is reduced by A# 
and the reduction in complexity for computing the transition 
matrix is proportional to Ay. 

Example 4: Fig[6ja) shows a network taken from [11] that 
is solvable only over fields with characteristic 2 and Fig|6jb) 
shows the corresponding reduced FDG. By labeling each 
edge in the reduced FDG, we can formulate the scalar linear 
network coding as: 



a± ct2 

CX3 &4 

as cxq 



B T — 






71 



76 75 
73 74 
72 








Pi 
#3 
















The transition matrix is given in ( [TT] ). By setting the 
diagonal element to be 1 and off-diagonal element to be 0, it 
can easily be proved that solution exists only if all the variables 
are chosen from a field with characteristic 2. Note that the 
total number of variables used in this formulation is 15 and 
the size of F is only 5x5. However, if the original line graph 
is used, the number of variables is 28 and the dimension of 
the adjacent matrix is 18 x 18. Therefore, the total amount of 
reduction in problem dimension is 46% and in complexity is 
92%. 




Fig. 6. A network that is solvable only over fields with characteristic 2 

V. Conclusions 

In this paper, reduced FDG is obtained from the original 
FDG by removing all the edges where network coding cannot 



be performed and where network coding is not necessary. With 
reduced FDG, various capacity bounds of the network can 
be computed much more efficiently without any loss in the 
tightness. Two applications of reduced FDG are discussed in 
this paper, one is in the computation of LP bound and the other 
is in algebraic formulation of scalar linear network coding. In 
both cases, the reduced FDG significantly reduces the problem 
size and complexity. Note that the applications of the reduced 
FDG is not limited to the areas discussed in this paper. It may 
also help to identify the encoding complexity and simplify the 
code construction. 
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